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Abstract 

We present analytical and numerical results of modeling of flows represented 
as the correlated non-Poissonian point process and as the Poissonian sequence 
of pulses of the different size. Both models may generate signals with the power- 
law distributions of the intensity of the flow and the power-law spectral density. 
Furthermore, different distributions of the interevent time of the point process 
and different statistics of the size of pulses may result in l/f@ noise with 0.5 < 
(3 < 2. Combination of the models is applied for modeling of the Internet traffic. 

1 Introduction 

Modeling and simulations enable one to understand and explain the observable phe- 
nomena and predict the new ones. This is true, as well, for the mathematical study and 
modeling of the traffic flow with the aim to get a better understanding of phenomena 
and avoid some problems of traffic congestion. Traffic phenomena are complex and 
nonlinear, they show cluster formation, huge fluctuations and long-range dependen- 
cies. Almost forty years ago from the empirical data it was detected that fluctuations 
of a traffic current on a expressway obey the 1/f law for low spectral frequencies [1]. 
Similarly 1/f noise is observable in the flows of granular materials [2,3]. 

1/f noise, or 1/f fluctuations are usually related with the power-law distributions 
of other statistics of the fluctuating signals, first of all with the power-law decay of 
autocorrelations and the long-memory processes (see, e.g., comprehensive bibliography 
of 1/f noise in the website [4], review articles [5,6] and references in the recent paper 
[7]). The appearance of the clustering and large fluctuations in the traffic and granular 
flows may be a result of synchronization of the nonlinear systems with stopping and 
driven by common noise, resulting in the nonchaotic behavior of the Brownian-type 
motions, intermittency and 1/f noise [8,9]. 

The traffic and granular flows usually may be considered as those consisting of 
the discrete identical objects such as vehicles, pedestrians, granules, packets and so 
on, they may be represented as consisting of pulses or elementary events and further 
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simplified to the point process model [7,10-12]. Moreover, from the modeling of the 
traffic it was found that 1/f noise may be the result of clustering and jumping [10] 
similar to the point process model of 1/f noise [7, 11, 12]. 

On the other hand, 1/f noise may be conditioned by the flow consisting of uncorre- 
lated pulses of variable size with the power-law distribution of the pulse durations [13]. 
In the Internet traffic the flow of the signals primarily is composed of the power-law 
distributed file sizes. The files are divided by the network protocol into the equal pack- 
ets [14]. Therefore, the total incoming web traffic is a sequence of the packets arising 
from large number of requests. Such a flow exhibits 1/f fluctuations, as well [14,15]. 

The long-range correlations and the power-law fluctuations in the wide range of the 
time scale from minutes to months of the expressway traffic flow have recently been 
observed and investigated using the method of the detrended fluctuation analysis [16]. 
There are no explanations why the traffic flow exhibit 1/f noise behavior in such a 
large interval of the time. 

It is the purpose of this paper to present analytical and numerical results for the 
modeling of flows represented as sequences of different pulses and as a correlated 
non-Poissonian point process resulting in 1/f noise and to apply these results to the 
modeling of the Internet traffic. 

2 Signal as a sequence of pulses 

We will investigate a signal of flow consisting of a sequence of pulses, 

I{t)=^A k (t-t k ). (1) 

k 

Here the function Af.(t — %) represents the shape of the k pulse having influence on 
the signal I(t) in the region of time 

2.1 Power spectral density 

The power spectral density of the signal (Q) can be written as 

tf-t k t f -t k , 

S(/)=Hm /|X) e " (tfc ~ V) / / Mn)A kl {u')e^ u ' u ">dudu'Y (2) 

where u = 2irf, T — tf — ti ^> oj~ 1 is the observation time and the brackets (. . .) 
denote the averaging over realizations of the process. We assume that pulse shape 
functions A k (u) decrease sufficiently fast when \u\ — > oo. Since T — > oo, the bounds 
of the integration in Eq. (J2J) can be changed to ±oo. 

When the time moments t k are not correlated with the shape of the pulse Aj., the 
power spectrum is [2] 

S(f)=l\m^J2( eMtk ' tk,) ){ I I A k (u)A k ,(u')e^ u -^dudu'). (3) 

k,k> 
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After introduction of the functions [13] 

-00 +00 



and 

XkA") = <e iw(tfc - V) > (5) 

the spectrum can be written as 

2 x 

S(f) = lim - Vxy'(w)*t,fc'(w). (6) 

J — >oo ± ' ' 

k,k' 

2.2 Stationary process 

Equation (JHJ) can be further simplified for the stationary process. Then all averages 
can depend only on k — k f , i.e., 



and 

Equation © then reads 



* fc)fc /(w) = *fc_ fc 'H (7) 
Xfc,fc'( w ) = Xfe-ik'(^)- (8) 



= lim I V ^(wjft.^w). (9) 

k,k' 

Introducing a new variable q = k — k' and changing the order of summation, yield 

&max~&min ^max <? 



9 1 &mui 



2 1 ^max ^ ^max 

^min ^max & — &min 9 & = &min 

Here /c m i n and fc max are minimal and maximal values of the index k in the interval of 
observation T. Eq. (fTUj) may be simplified to the structure 

N 

S(f) = 2m{u) +Hm 4^T (p- |) HexgH* ff (w) (11) 

g=l 

where v is the mean number of pulses per unit time and N = k max — k min is the number 
of pulses in the time interval T. 

If the sum j, J2q=i Q^Xqi^^qi^) ~ > when T — > 00, then the second term in 
the sum vanishes and the spectrum is 

00 00 
S(f) = 2P* (w) + 4P^Rex,M^M = 2^ ^ (12) 

q=l g=— 00 
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2.3 Fixed shape pulses 

When the shape of the pulses is fixed (^-independent) then the function *&k,k'{w) does 
not depend on k and k' and, therefore, *&k,k'{w) — ^0,0 (^)- Then equation © yields 
the power spectrum 



S(f) = #o,o(w) lim — } Xk,k'{u) = Vo,o(u)S s {u) 

1 — >oo ± 



(13) 



fe,fc' 



Eq. (|T3*j) represents the spectrum of the process as a composition of the spectrum of 
one pulse, 



#0,1 



A k (t)e lu)t dt 



and the power density spectrum Ss(uj) of the point process 

I s (t) =aJ2$(t-tk) 
k 

with the area of the pulse a = 1. 



(14) 



(15) 



3 Stochastic point processes 

The shapes of the pulses mainly influence the high frequency power spectral density, 
i.e., at uj > 1/Atp, with At p being the characteristic pulse length. Therefore the power 
spectral density at low frequencies for the not very long pulses is mainly conditioned 
by the correlations between the transit times t k , i.e., the signal may be approximated 
by the point process. 

The point process model of l//' 3 noise has been proposed [11,12], generalized [7], 
analysed and used for the financial systems [17]. It has been shown that when the 
average interpulse, interevent, interarrival, recurrence or waiting times = t^+i — th 
of the signal diffuse in some interval, the power spectrum of such process may exhibit 
the power-law dependence, S$(f) ~ 1/ f 13 , with 0.5 < (3 < 2. The distribution density 
of the signal (JT5J) intensity defined as / = l/r k may be of the power-law, P(I) ~ /~ A , 
with 2 ^ A ^ 4, as well. The exponents (3 and A are depending on the manner of 
diffusion-like motion of the interevent time r k and, e.g., for the multiplicative process 
are interrelated [7,17]. For the pure multiplicative process [7] 

/3=l + a, A = 3 + a, (16) 

where a is the exponent of the power-law distribution, Pfc( r fc) ~ Tj*, of the interevent 
time. In general, for relatively slow fluctuations of r^, the distribution density of the 
flow /, 

p(i) ~ p k (r l )r 3 , (17) 

is mostly conditioned by the multiplier J~ 3 . As far as the point process model has 
recently [7,17] been analysed rather properly here we will not repeat the analysis and 
only present some new illustrations. 
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Figure 1: Distribution densities of the interevent time r k , (a), of the flow I(t), (b), and 
of the power spectra S(f), (c), for different point processes with the slow diffusion-like 
motion of the average interevent time. Different symbols correspond to different types 
of the generation of the interevent sequences. 



Figure H] demonstrates that for essentially different distributions of 7>, the power 
spectra and distribution densities of the point processes are similar. 

Further we proceed to the flow consisting of the pulses of different durations and 
application of this approach for modeling of the Internet traffic. 

4 Flow consisting of pulses of variable duration 

When the occurrence times t k of the pulses are uncorrelated and distributed according 
to the Poisson process, the power spectrum of the random pulse train is given by the 
Carlson's theorem 

S(f)=2u(\F k (uj)\ 2 ), (18) 

where 

+oo 

F k (u) = f A k (t)e iu)t dt (19) 

— oo 

is the Fourier transform of the pulse A k . Suppose that the random parameters of the 
pulses are the duration and the area (integral) of the pulse. We can take the form of 
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the pulses as 

Mt-tk)=WA(^y (20) 

where is the characteristic duration of the pulse. The value of the exponent p = 
corresponds to the fixed height but different durations, the telegraph-like pulses, 
whereas p = — 1 corresponds to constant area pulses but of different heights and 
durations, and so on. 

For the power-law distribution of the pulse durations, 

{ 5+1 rpS rp < T, < T 

Jmax i min (21) 
0, otherwise, 

from Eqs. (fTHj) and (pHJj) we have the spectrum 

S(f)= - 2V{ -*t l \ I \F(u)\ 2 u 8+2 » +2 du. (22) 



fTtf+l _ 7"»+lW+2p+3 
V max min/ 

Ml m in 

For t~I x <C to <C r m l n when <5 > — 1 the expression (J22|) may be approximated as 

oo 

« 2P( ^ 1) , / |F(u)|V +2 " +2 du. (23) 

yJ ' fT^a+l _ 7"5+l\ 5+2/9+3 / V 7 

V max min/ ^ 

Therefore, the random pulses with the appropriate distribution of the pulse duration 
(and area) may generate signals with the power-law distribution of the spectrum with 
different slopes. So, the pure 1/ / noise generates, e.g., the fixed area (p = —1) with 
the uniform distribution of the durations (5 = 0) sequences of pulses, the fixed height 
(p = 0) with the uniform distribution of the inverse durations 7 = T^ 1 and all other 
sequences of random pulses satisfying the condition 5 + 2p = —2. 
In such a case from Eq. ()23|) we have 

S(f) - } 6 + 1) L • ( 24 ) 

V max min/J 



5 The Internet traffic 

In this Section we will apply the results of the Section 0] for modeling the Internet 
traffic. The incoming traffic consists of sequence of packets, which are the result of 
the division of the requested files by the network (TCP) protocol. Maximum size of 
the packet is 1500 bytes. Therefore, the information signal is as the point process (JT5j) 
with the pulse area a = 1500 bytes. Further, we will analyse the flow of the packets 
and will measure the intensity of the flow in packets per second. In such a system of 
units in Eq. (|T5)l we should put a — 1. 

We exploit the empirical observation [14, 18] that the distribution of the file sizes 
x may be described by the positive Cauchy distribution 

P{x) = -^— 2 (25) 

7T S + X 2 
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with the empirical parameter s = 4100 bytes. This distribution asymptotically exhibits 
the Pareto distribution and follows the Zipf's law P (X > x) ~ 1/x. The files are 
divided by the network protocol into packets of the maximum size of 1500 bytes or 
less. In the Internet traffic the packets spread into the Poissonian sequence with the 
average inter-packet time t p (see Fig. |2J). The total incoming flow of the packets to 
the server consists of the packets arising from the Poissonian request of the files with 
the average interarrival time of files t/. 

The files are requested from different servers located at different distance. This 
results in the distribution of the average inter-packet time r p in some interval. For 
reproduction of the empirical distribution of the interpacket time we assume the 
uniform distribution of lg in some interval \Tk,mim Tk,max] , similarly to the McWhorter 
model of 1/ f noise [7). As a result, the presented model reproduces sufficiently well the 
observable non-Poissonian distribution of the arrival interpacket times and the power 
spectral density, as well (see Fig. EJ). 
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Figure 2: Division of the requested files into equal size packets with some inter-packet 
time. 



6 Conclusion 

In the paper it was shown that the processes exhibiting 1/ / noise and the power-law 
distribution of the intensity may be generated starting from the signals as sequences 
of constant area pulses with the correlated appearance times as well as of different 
size Poissonian pulses. Combination of both approaches enables the modeling of the 
signals in the Internet traffic. 
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Figure 3: Distribution densities of the interpacket time T&, (a), and the power 
spectra, (b), for the simulated point process (open circles) and the empirical data 
(open squares). The used parameters are as in the empirical data [14, 18], r/ = 
0.101s, r Kmin = 11.6/is and T Kmax = 1000 T Kmin 
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